Pitch dependent phone modelling for HMM-based speech recognition.
نویسندگان
چکیده
منابع مشابه
Distributed Speech Recognition HMM Modelling
Speech Recognition performed over a network is referred to as Distributed speech recognition (DSR). It is a technique that enables access to services and communication systems without the need to type or use a keypad. The primary objective of speech recognition is to enable all of us to have easy access to the full range of computer services and communication systems, without the need for all o...
متن کاملConfidence measures for HMM-based speech recognition
In this paper, we describe our work on the field of confidence measures for HMM-based speech recognition. Confidence measures are a means of estimating the recognition reliability for single words of the recognizer output. The possible applications of such measures are manifold. We present our experiments with well known approaches and propose some new ones. Particularly, we propose to combine ...
متن کاملSVR vs MLP for Phone Duration Modelling in HMM-based Speech Synthesis
In this paper we investigate external phone duration models (PDMs) for improving the quality of synthetic speech in hidden Markov model (HMM)-based speech synthesis. Support Vector Regression (SVR) and Multilayer Perceptron (MLP) were used for this task. SVR and MLP PDMs were compared with the explicit duration modelling of hidden semi-Markov models (HSMMs). Experiments done on an American Engl...
متن کاملPhone set selection for HMM-based dialect speech synthesis
This paper describes a method for selecting an appropriate phone set in dialect speech synthesis for a so far undescribed dialect by applying hidden Markov model (HMM) based training and clustering methods. In this pilot study we show how a phone set derived from the phonetic surface can be optimized given a small amount of dialect speech training data.
متن کاملA HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech
This paper presents an HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speech. The pitch movements are de ned according to the perceptively and phonetically motivated IPO-approach to intonation. For recognition we use a hybrid approach combining polynomial classi cation with Hidden Markov Modelling. The recognition is based only on the speech signal, i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the Acoustical Society of Japan (E)
سال: 1994
ISSN: 0388-2861,2185-3509
DOI: 10.1250/ast.15.77